Emergent collective behaviors in a multi-agent reinforcement learning based pedestrian simulation

نویسندگان

  • Francisco Martinez-Gil
  • Fernando Fernández
  • Miguel Lozano
چکیده

In this work, a Multi-agent Reinforcement Learning framework is used to get plausible simulations of pedestrians groups. In our framework, each virtual agent learns individually and independently to control its velocity inside a virtual environment. The case of study consists on the simulation of the crossing of two groups of embodied virtual agents inside a narrow corridor. This scenario permits us to test if a collective behavior, specifically the lanes formation is produced in our study as occurred in corridorswith real pedestrians. The paper studies the influence of different learning algorithms, function approximation approaches, and knowledge transfer mechanisms in the performance of the learned pedestrian behaviors. Specifically, two different RL-based schemas are analyzed. The first one, Iterative Vector Quantization with Q-Learning (ITVQQL) improves iteratively a state-space generalizer based on vector quantization. The second scheme, named TS, uses Tile coding as the generalization method with the Sarsa(λ) algorithm. Knowledge transfer approach is based on the use of Probabilistic Policy Reuse to incorporate previously acquired knowledge in current learning processes; additionally, value function transfer is also used in the ITVQQL schema to transfer the value function between consecutive iterations. The results demonstrate empirically that our RL framework generates individual behaviors capable of emerging the expected collective behavior as occurred in real pedestrians. This collective behavior appears independently of the generalization method used, but depends extremely on whether knowledge transfer was applied or not. In addition, the use of transfer techniques has a notable influence in the final performance (measured in number of times that the task was solved) of the learned behaviors. A video of the simulation is available at the URL: http://www.uv.es/agentes/RL/index.htm

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MARL-Ped: A multi-agent reinforcement learning based framework to simulate pedestrian groups

Pedestrian simulation is complex because there are different levels of behavior modeling. At the lowest level, local interactions between agents occur; at the middle level, strategic and tactical behaviors appear like overtakings or route choices; and at the highest level path-planning is necessary. The agent-based pedestrian simulators either focus on a specific level (mainly in the lower one)...

متن کامل

Multi-agent Reinforcement Learning for Simulating Pedestrian Navigation

In this paper we introduce a Multi-agent system that uses Reinforcement Learning (RL) techniques to learn local navigational behaviors to simulate virtual pedestrian groups. The aim of the paper is to study empirically the validity of RL to learn agent-based navigation controllers and their transfer capabilities when they are used in simulation environments with a higher number of agents than i...

متن کامل

Detection of Primitive Collective Behaviours in a Crowd Panic Simulation Based on Multi-Agent Approach

We propose an approach towards multi-agent system for simulation and detection of primitive collective behaviors emerging from a crowd in panic. This paper presents various works on which our method is based, by methods of planning and decisions allowing emergence of primitive collective behaviors. We present then an implementation in a virtual environment and detection experiments of emergent ...

متن کامل

Collective Robots Navigation by Reinforcement Learning Mechanisms with Common Knowledge Field ––an Approach for Heterogeneous-agents Systems––

In this study, we propose a new approach to realize a reinforcement learning scheme for heterogeneous multiagent systems. In our approach, we treat the collective agents systems in which there are multiple autonomous mobile robots, and given tasks are achieved based on the collective behavior approach. Also, each agent organizes and refines its knowledge for executing its own behaviors by reinf...

متن کامل

Multiagent Supervised Training with Agent Hierarchies and Manual Behavior Decomposition

We present a supervised learning from demonstration system capable of training stateful and recurrent behaviors, both in the single agent and multiagent case. Furthermore, behavior complexity due to statefulness and multiple agents can result in a high dimensional learning space, which can require many samples to learn properly. Our approach, which relies heavily on both per-agent behavior deco...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013